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Summary 

Bovine coronavirus (BCoV) is the causative agent of diarrhea in newborn calves, winter dysentery in 
adult cattle, and respiratory tract illnesses in cattle across the world. In this study, a total of 190 fecal 
samples from dairy calves with diarrhea were collected from 14 farms in six Chinese provinces, and 
BCoV was detected in 18.95% (36/190) of the samples by reverse-transcriptase polymerase chain 
reaction. Full-length spike, hemagglutinin/esterase (FIE), nucleocapsid, and transmembrane genes 
were simultaneously cloned from 13 clinical samples (8 farms in 4 provinces), and most of the BCoV 
strains showed a unique evolutionary pattern based on the phylogenetic analysis of these genes. 
Interesting, 10 of the 13 strains were identified as FIE recombinant strains, and these strains had 
experienced the same recombination event and carried the same recombination sites located between 
the esterase and lectin domain. They also shared an identical aa variant (FI8IV) in the R2-loop. 
Moreover, 9/10 strains displayed another identical aa variant (P, S158A) in the adjacent Rl-loop of 
the FIE gene, which differs from the other available BCoV FIE sequences in the GenBank database. 
Our results showed that BCoV is widely circulating in dairy cattle in China, contributing to the 
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diagnosis and control of dairy calves diarrhea. Furthermore, a BCoV strain that carries a recombinant 


HE gene has spread in dairy calves in China. To the best of our knowledge, this is the first description 
of an HE recombination event occurring in BCoV; this is also the first description of the molecular 
prevalence of BCoV in China. Our findings will enhance current understanding about the genetic 
evolution of BCoV. 

Keywords bovine coronavirus, prevalence, hemagglutinin/esterase gene, recombination, dairy calves, 
China 


1. INTRODUCTION 

Bovine coronavirus (BCoV) is a lineage A member of the betacoronavirus genus. Other members 
include human OC43 coronavirus (HCoV-OC43), mouse hepatitis virus (MHV), equine coronavirus, 
porcine hemagglutinating encephalomyelitis virus, and canine respiratory coronavirus. BCoV, which 
causes diarrhea in newborn calves, winter dysentery in adult calves and respiratory tract illnesses in 
calves and adult cattle, inflicts in severe economic losses on the global farming industry (Azizzadeh et 
al., 2012; BOK et al., 2015; Johnson and Pendell, 2017). 

BCoV possesses five major structural proteins: the spike (S), hemagglutinin/esterase (HE), 
nucleocapsid (N), transmembrane (M) and the small membrane (E) (Lai and Cavanagh, 1997). The S 
protein is involved in receptor recognition and carries distinct functional domains near its amino (SI) 
and carboxy (S2) termini, while the N-terminal SI domains recognize sugar receptors, and the S2 
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subunit is a transmembrane protein that mediates viral and cellular membrane fusion during cell 
invasion (Fang Li, 2016). SI and S2 contain several antigenic domains, but SI appears to be the most 
efficient at inducing antibodies with high neutralizing activities in its host (Yoo and Deregt, 2001). 
The FIE protein contains two important functional domains: the lectin domain and the esterase 
domain. The lectin domain recognizes sugar receptors in the cell, whereas the esterase domain 
possesses a receptor-destroying enzyme activity capable of removing cellular receptors from the 
surfaces of the targeted cells. The receptor-binding (lectin) and receptor-destroying (esterase) domains 
may be important for virus entry (Kienzle et al., 1990; Schultze et al., 1991). Therefore, in addition to 
the S protein, the FIE protein serves as a second viral attachment protein for infection initiation 
(Groot, 2006). The primary role of BCoV N protein is to package the viral genome into long, flexible, 
helical ribonucleoprotein (RNP) complexes, protect the genome, and ensure its timely replication and 
reliable transmission, as well as playing a role in viral transcription and translation (Flurst et al., 

2010). In contrast, the M protein plays a crucial role in BCoV assembly (Oostra et al., 2006). 


The high genetic diversity in coronaviruses is attributable to the high mutation rates associated with 
RNA replication, the high recombination frequencies within the coronavirus family, and the large 
coronavirus genomes (Woo et al., 2009). Recombination in coronaviruses plays an important role in 
virus evolution, and can result in the emergence of new pathotypes (Wang et al., 2015; Menachery et 
al., 2017) as well as changing the host ranges and ecological niches (Bakkers et al., 2017). Thus far, 
recombination regions in coronaviruses have been extensively reported for the S gene (Lau et al., 
2011; Kin et al., 2015; Minami et al., 2016), a finding also applicable to BCoV (Martinez et al., 
2012). Recombination events in M (Flerrewegh et al., 1998), N (Kin et al., 2015), RP3 (Lau et al., 
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2010) and the ORF1 gene (Kin et al., 2015; Chen et al., 2017) have also been reported. However, to 


date, recombination events in HE have only been reported in MHV, a betacoronavirus, and this 
situation may act as a strong force for generating strains with new genotypes, host spectra and tissue 
tropisms (Luytjes et al., 1988; Smits et al., 2005; Groot, 2006). 

The presence of BCoV has been confirmed in Chinese dairy cows (GenBank accession number 
FJ556872), but the prevalence and molecular characteristics of BCoV are still largely unknown. 
Therefore, we sought to investigate the prevalence of BCoV in dairy calves with diarrhea in China. 
Unexpectedly, our results reveal that a BCoV containing a recombinant HE gene has emerged and 
spread in dairy calves in China. 

2. MATERIALS AND METHODS 

2.1. Fecal samples 

A total of 190 fecal samples were collected from dairy calves (<3 months of age) with obvious 
diarrhea at 14 farms from six provinces in China during September 2017 and May 2018 (Table 1). 
The samples were shipped on ice and stored at -80°C. 

2.2. RNA extraction and cDNA synthesis 

The fecal samples were fully resuspended in phosphate-buffered saline (1:5 w/v) and centrifuged at 
10,000 x g for 10 min. Viral RNA was extracted from 300 pi of the fecal suspension using RNAios 
Plus (TaKaRa Bio Inc., Japan) according to the manufacturer’s instructions. The cDNA was 
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synthesized using the PrimeScript™ RT Reagent kit according to the manufacturer’s instructions 
(TaKaRa Bio Inc.), and then stored at -20 °C until required. 

2.3. BCoV detection using polymerase chain reaction (PCR) 

BCoV nucleic acids in the fecal samples were identified using a PCR assay established in our 
laboratory that targets the BCoV polymerase gene. After validating the specificity and stability of the 
assay, the detection limit for the viral nucleic acid in the assays was determined to be 1 x 10 2 pg per 
pL-1. In detail, a primer pair (F: 5'- CGAGTTGAACACCC AGAT-3', R: 

5'-GAGACGGGCATCT AC ACT-3') were used to amplify a 230 nucleotide (nt) region of the 
polymerase gene (position 13,144 bp-13,373 bp) in the BCoV Mebus genome sequence, GenBank 
accession: U00735.2). The PCR mixture (25 pi) contained 1 pi cDNA, 1 pi forward primer (10 pM), 1 
pi reverse primer (10 pM), 12.5 pi PremixTaq (TakaRa Taq version 2.0 plus dye) and 9.5 pi 
nuclease-free water. The mixtures were PCR-amplified with 35 cycles at 94 °C for 5 min, 94 °C for 
30 s, 49 °C for 30 s and 72 °C for 30 s, with a final extension step at 72 °C for 10 min in an automated 
thermal cycler (BIOER, TC-96/G/H(b)c). 


2.4. PCR amplification of S, HE, N and M genes 

The complete S, HE, N and M genes were PCR-amplified from samples already known to be 
BCoV-positive based on RT-PCR assays previously reported (Gelinas et al., 2001; Park et al., 2006; 
Martinez et al., 2012; Lau et al., 2011). All PCR products were purified using the Omega Gel kit 
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(Omega, USA) following the manufacturer’s instructions, after which they were ligated to the 
pMD19-T vector (TaKaRa Bio Inc.) and transformed into DH5a competent Escherichia coli cells 
(Yeasen, China) for sequencing. The S and N gene sequences were assembled using SeqMansoftware 
(version 7.0; DNASTAR Inc., WI, USA). 

2.5. Sequence, phylogenetic, and recombination analyses 

The homologies of the nt and deduced amino acid (aa) sequences were determined using the 
MegAlign program in DNASTAR 7.0 software (DNASTAR Inc.). MEGA 7.0 was used for multiple 
sequence alignment and to subsequently build the maximum-likelihood phylogenetic tree with 
bootstrap testing (1,000 replicates). Recombination events were assessed using SimPlot software 
(version 3.5.1) and the Recombination Detection Program RDP 4.0 (version 4.9.5) with the RDP, 
GeneConv, Chimaera, MaxChi, BootScan, SiScan, and 3Seq methods (Martin et al., 2015). 


3. RESULTS 

3.1. BCoY detection in samples of fecal diarrhea from dairy calves 

Of the 190 fecal samples from the calves with diarrhea, 36 (18.95 %) were found to be BCoV positive, 
which revealed that the virus was distributed in 13/14 farms across the 6 provinces (Table 1). 
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3.2. PCR amplification of the S, HE, N and M genes from BCoV 


Full-length S, HE, N and M genes were successfully cloned out of 13 positive samples from 8 farms 
in 4 different Chinese provinces (Shanxi, 2 strains; Henan, 3 strains; Liaoning, 5 strains; and Sichuan, 
3 strains). 


3.3 . Molecular characterization of the S genes 

The 13 S genes, at 4,092 -bp each, encode a protein of 1,363 aa, the cleavage site of which is located 
at aa 768 in all 13. Sequence comparisons revealed that all 13 S genes share 98.6%-100% nt identity 
and 98.5%-100% aa identity with each other. They also share 96.8%-100% nt identity and 
95.3%-100 % aa identity with all 163 full-length BCoV S genes available in the GenBank database. 
A phylogenetic tree based on the complete S gene sequences using the maximum-likelihood method 
showed that 12 of the 13 S genes from this study together with 13 other BCoV S genes from China (1 
strain from cattle, GenBank accession number KU886291; 12 strains from Yaks, Bos grunniens, 
submitted by our team, GenBank accession number MH810151- MH810162) clustered on an 
independent large branch. The remaining S genes clustered with 3 North American BCoV strains 
(GenBank accession number MH043952, MH043954, and MH043955) on a small independent 
branch of the tree (Fig. 1). Compared with the other BCoV S genes, 9/13 sequences from this study 
and the 13 other Chinese BCoV sequences motioned above, which were located in the independent 
large branch, each had an identical aa variant (N1192Y) in the S2 subunit. Additionally, 4/13 
sequences from this study and the above-mentioned 12 sequences from Chinese Yaks, which are 
located in the large independent branch, have an identical aa variant (E12IV) in the SI subunit. 
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Compared with the BCoV Mebus prototype strain, these BCoV S genes have a total of 13 aa changes 


in the S1 subunit and 3 aa changes in the S2 subunit (Fig. 2). No frame shifts, deletions, insertions or 
recombination events were observed in the S gene sequences from all the strains in this study. 


3.4. Molecular characterization of the HE genes 

All 13 HE genes were 1,275 -bp long, and the protein they encode is 424 aa residues in length. FGDS, 
the putative esterase active site in all HE proteins was located at aa positions 37 to 40, and nine 
N-glycosylation sites were located at 54, 89, 104,153, 236, 301, 316, 358, 417 in the protein. 

Sequence comparisons of the HE genes revealed that the 13 strains shared 97.3%-100% nt sequence 
identity and 98.8%-100% aa identity between each other, and shared 96.1%-99.3% nt sequence 
identity and 93.2%-99.1% aa sequence identity with all 115 of the complete BCoV HE genes 
available in the GenBank database. A phylogenetic tree based on the full-length HE genes and the 
maximum-likelihood methodology showed that 10/13 of these genes together with 2 Chinese BCoV 
HE genes (1 strain from cattle, GenBank accession number KU886291; 1 strain from Yak, Bos 
grunniens, submitted by our team, GenBank accession number MH810163) clustered into a large 
independent branch. The remaining 3 HE genes clustered with HE genes of 3 North American BCoV 
strains (GenBank accession number FJ938065, AF230528, and AF391542) into a small independent 
branch (Fig. 3). 
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Interestingly, recombination analysis of the 13 complete HE genes showed that recombination events 


had occurred in 10 of the strains (GenBank accession number MK095135-MK095140, MK095142 
and MK095145-MK095147) from 8 farms in 4 provinces, as based on the RDP 4.0 (6 method) and 
SimPlot 3.5.1. The recombination breakpoint based on RDP 4.0 (6 method: RDP, GeneConv, 
Chimaera, MaxChi, SiScan, and 3Seq; recombinant score: 0.613) identified the beginning of the 
breakpoint at nt 150 in the fragment (breakpoint 99% confidence intervals: nt position 1-524 in the 
fragment) and nt 724 at the end of the breakpoint (breakpoint 99% confidence intervals: nt position 
544-869 in the fragment), in the putative major parental strain, KCD1 (GenBank accession number 
DQ389642), and the possible minor parental strain, LY-138 (GenBank accession number M84486). 
However, using SimPlot 3.5.1, the crossover site of the putative parental strains mainly mapped to nt 
168 at the beginning of the breakpoint and out nt 702 at the end of the breakpoint in the 1,275 bp 
sequence (Fig. 4). Although the recombination breakpoints predicted by RDP 4.0 and SimPlot differ, 
both programs showed that the recombination breakpoint was located between the esterase and lectin 
domain in HE (at nt position 70 bp-1,024 bp in the full-length BCoV HE gene). Sequence 
comparisons of the 10 strains in which the HE gene showed evidence of recombination revealed that 
the strains shared 98.4 %-100 % nt identity and 97.9 %-100 % aa identity between each other. In 
addition, 2 China strains (1 strain from cattle, GenBank accession number KU886291; 1 strain from 
Yak, GenBank accession number MH810163) that clustered with the 10 recombinant strains based on 
phylogenetic tree were identified as recombinant strains also, with the recombination event the same 
as that in this study. 
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Compared with the Mebus prototype strain, the 13 HE genes cloned in the present study each 


have an aa substitution (L5P) in the signal peptide, an aa substitution (N49T) in the putative esterase 
domain and an aa substitution in the membrane-proximal domain (S367P) (Fig. 5). No frame shifts, 
deletions or insertions were observed in the HE gene sequences in this study. Compared with all other 
available BCoV HE genes in the GenBank database, we found that the recombinant HE genes in the 
10 strains and the 2 Chinese BCoV HE genes (GenBank accession number KU886219, MH810163) 
that were also identified as HE recombination strains in this study, each have an identical aa variant 
(FI8IV) located in the R2-loop of the HE gene. Moreover, 9/10 strains with recombinant HE genes 
carried an identical aa variant (P, S158A) located adjacent to the Rl-loop in the HE gene; 
surprisingly, the aa residue at this site is identical to that in the HCoV-OC43 (Fig. 5). 

3.5. Molecular characterization of N and M genes 

All of the 13 N genes were 1,347 -bp in length, each encoding a protein of 230 aa residues. Sequence 
comparison of these genes revealed that they share 99.8 %-100 % nt sequence identity and 99.3 
%-100 % aa identity between each other, and share 96.4 %-99.6 % nt sequence identity and 96.8 
%-99.9 % aa identity with all 81 of the full-length BCoV N genes available in the GenBank database. 
The phylogenetic tree based on the complete N gene sequences and the maximum-likelihood method, 
showed that all 13 N genes, together with 1 Chinese strain from Yak (GenBank accession number 
MH810163), clustered on an independent large branch (Fig.6). Compared with other BCoV N genes, 
all 13 strains in this study and the Chinese strain (GenBank accession number MH810163) carry an 
identical aa variant (S416I) in the N gene. Compared with the Mebus prototype strain, the 13 strains 
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have 6 other aa changes (F15S, L53Q, M386T, M387I, S423I and Y441F) in the N gene. No frame 


shifts, deletions or insertions were observed in the N gene sequences from all strains in this study. 

In contrast, all 13 M genes were 690 -bp in length, each encoding a protein of 230 aa residues. 
Sequence comparisons of the M genes revealed that the 13 M genes shared 99.4 %-100 % nt identity 
and 99.6 %-100 % aa identity between each other, and shared 94.1 %-100 % nt identity and 94.4 
%-100 % aa identity with all 75 complete BCoV M genes available in the GenBank database. The 
phylogenetic tree based on the complete M gene sequences and the maximum-likelihood method 
showed that all 13 M genes, together with 4 recently reported M genes from North American strains 
(GenBank accession number MF1043952, MH043953, MH043954 and MF104955) fell into an 
independent large branch (Fig.7). Compared with the other BCoV M genes, all the strains in this study 
and the 4 North American strains (GenBank accession number MF1043952, MF1043953, MH043954 
and MH04955) share 2 identical aa variants (C126S, T175S) in the M gene. Compared with the M 
gene from the Mebus prototype strain, the 13 strains have 2 other aa changes (I38V, M221L) in the N 
gene. No frame shifts, deletions or insertions were observed in the N gene sequences from all the 
strains examined in this study. 


4. DISCUSSION 

Bovine coronavirus, an important pathogen of calves, is globally responsible for severe economic 
losses in farming (Azizzadeh et al., 2012; BOK et al., 2015; Johnson and Pendell, 2017). In China, the 
prevalence of BCoV is still largely unknown. Therefore, in this study, we screened 190 diarrhea fecal 
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samples from calves, 36 of which were found to be BCoV positive, and the positive samples were 


distributed across 13 of the 14 farms we screened across six provinces in the major dail y cattle 
production areas of China. The results showed that BCoV is circulating widely in these dairy cattle, a 
finding that should help with the diagnosis and control of diarrhea in these animals. Most of the 
strains from this study are unique in their evolutionary histories based our analysis of the full-length 
S, HE, N and M genes, a finding similar to that for BCoV in Korean (Park et al., 2010; Ko et al., 
2006). This may be the result of geographical, environmental and natural selection patterns (Hasoksuz 
et al., 2002; Martinez et al., 2012; Bidokhti et al., 2013). 

The BCoV S protein is involved in receptor recognition, host specificity, antigenic diversity, and 
immunogenicity (Fang Li, 2016). Its gene sequences are variable, and mutations in this protein are 
associated with alterations in viral antigenicity, viral pathogenicity, host range and tissue tropism 
(Gallagher and Buchmeier, 2001; Peng et al., 2012). In this study, compared with other BCoV S 
genes, we found that 9 out of 13 of our sequences and 13 Chinese BCoV sequences (1 strain from 
cattle and 12 strains from Yaks), which clustered on a large independent branch of the phylogenetic 
tree, each had an identical aa variant (N1192Y) in the S2 subunit. As a transmembrane protein, the S2 
subunit mediates the fusion of viral and cellular membranes (Luo and Weiss, 1998); hence, the 
biological significance of this variant warrants further investigation. In addition 4 out of 13 sequences 
and 12 sequences from Chinese Yaks were found to have an identical aa variant (E121V) in the SI 
receptor-binding region, compared with the other S genes. The SI subunit in the N-terminal of BCoV 
(aa 1 to 330) recognizes a sugar receptor (Peng et ah, 2012), and aa substitutions in this region can 
change the receptor binding capacity (Li et al., 2005) and host receptor specificity (Sheahan et al., 
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2008 ). Two BCoV strains (GenBank accession number MK095177, MK095178) from Shanxi 


Province were found to have unique aa substitutions (E1051V, S1076P) in the heptad repeat region. 
This region is crucial for viral entry and for viral and host cell membrane interactions to occur, and it 
promotes lipid bilayer fusion and nucleocapsid release into the cytoplasm (Forni et al., 2015). Thus, 
aa substitutions in this region may affect the interaction between the coiled-coil structure and the host 
cell receptor (Martinez et al., 2012). The HE protein has a receptor binding function, which also plays 
a critical role in the infection process of BCoV (Langereis et al., 2012). The BCoV HE 
receptor-binding region comprises six surface loops, among which five are grafted onto the 
beta-sandwich core of the lectin domain, named the Rl-loop, R2-loop, R3-loop, R4-loop and the 
RBS-haiipin, and another is present on the esterase domain (E-loop) (Langereis et al., 2012). In our 
study, 10 of the 13 strains were identified as having the same recombination events in the HE gene, 
located between the esterase and lectin region (Fig. 5), this recombination phenomenon may affects 
receptor binding capacity. 


The 10 strains in which recombination had occurred in the HE gene, and 2 other Chinese strains 
(GenBank accession number KU886219, MH810163), share the same recombination event that we 
identified in this study. Moreover, these strains each had an identical aa variant (F181V) in the 
R2-loop of the HE gene, while 9 of the 10 recombinant strains had another identical aa variant (P, 
S158A) in the adjacent Rl-loop of the HE gene and this variant differs from all other available BCoV 
HE sequences in GenBank, and these recombinant strains clustered into a large independent branch in 
the phylogenetic tree (Fig.3). The 10 recombinant strains were recovered from 8 farms in 4 provinces 
across a wide geographical distance, with the two furthest provinces being more than 1000 km apart. 


This article is protected by copyright. All rights reserved. 



Thus, these novel HE recombinant strains have been circulating widely in dairy cattle in China. To the 


best of our knowledge, this is the first report of a recombination event in the HE gene from BCoV in 
cattle, a finding that augments current understanding about the evolution of BCoV. In fact, MHV, 
which is another lineage A member of the betacoronavirus genus, has also reportedly undergone a 
nonhomologous recombination event in the HE gene (Luytjes et al., 1988). Recombination in the HE 
gene from Influenza C virus and in toroviruses has also been observed (Smits et al., 2005; Groot, 
2006). Recombination in the HE gene may be a strong driving force for generating strains with new 
genotypes, host spectra and tissue tropisms (Luytjes et al., 1988; Smits et al., 2005; Groot, 2006). 
Notably, in our study, the HE recombinant and non-recombinant strains simultaneously existed on the 
same cattle farm in Liaoning Province. 

Interestingly, the reported decrease in the HE receptor binding capacity of HCoV-OC43 
betacoronaviruses was thought to be caused by an accumulation of aa substitutions (T114N, R177P, 
E178Q, F247L) in the receptor binding region during the course of evolution, and that aa substitutions 
(D220Y, H) in the adjacent R3-loop can reduce the HE receptor binding capacity of BCoV (Bakkers 
et al., 2017). The most common betacoronavirus HE receptor is 9-O-acetylated sialic acid 
(9-O-Ac-Sia) (Matrosovich et al., 2015). However, the MHV-S receptor is 4-O-Ac-Sia, not 
9-O-Ac-Sia, and this change may be caused by aa substitutions and/or insertions in the 
receptor-binding region of the HE lectin domain, as compared with the Mebus BCoV prototype strain 
(Langereis et al., 2012). The HE protein lectin domain recognizes the sugar receptor, and the R2-loop 
in the lectin domain (residues 176-185) plays an important role in ligand binding; aa substitutions in 
this region may reduce receptor-binding activity (Bakkers et al., 2017). In the present study, all the 


This article is protected by copyright. All rights reserved. 



strains with recombination in HE share an identical aa variant (FI8IV) in the R2-loop of the HE gene. 


Furthermore, 9 of the 10 strains have another identical aa variant (P, S158A) in the adjacent Rl-loop 
of their HE genes, which is an identical situation to that seen with the HCoV-OC43 (Bakkers et al., 
2017). Thus, further investigation of the significance of the receptor binding capacity caused by aa 
substitutions in the receptor-binding region of the HE recombinant strains is warranted. 

Notable, monoclonal antibodies against the BCoV HE protein efficiently neutralized BCoV infectivity 
in vitro (Deregt and Babiuk, 1987), and protected the intestinal epithelium of cattle from virus 
infection in vivo (Deregt et al., 1989), indicating that the HE protein of BCoV may also play a 
significant role in the induction of protetive effect on BCoV infection besides S protein. In this study, 
high prevalence of the HE recombinant strain was found in China, which has a significantly 
implication on BCoV vaccine development subsequently in China. It is also interesting to further 
investigate the protective effect of the vaccine developed by BCoV prototype strain on the HE 
recombinant strain of BCoV. In this study, we made efforts to isolate the recombinant strain using 
HRT-8 cell. Unfortunately, we failed. It may be caused by repeated freeze-thaw of samples when they 
were used for detecting other enterovirus before the detection of BCoV. Now, we are monitoring the 
outbreak of diarrhea in calves and plan to re-collect diarrheic fecal samples to isolate the virus, in 
order to further investigate biological characteristics of the HE recombinant strain. 
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In conclusion, the results of our study have shown that BCoVs are circulating widely in dairy calves 


in China, and that most of these strains have unique evolutionary pattern based on our phylogenetic 
analysis of the complete S, HE, N and M genes. Recombination events between the esterase and lectin 
domain of HE were identified as occurring at remarkably high frequencies and these recombinant 
strains are widely prevalent in dairy cattle in China. As far as we are aware, this is the first description 
of a recombination event in the HE gene of BCoV, and our findings will enhance current 
understanding about the genetic evolution of BCoV. 
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Fig.l Phylogenetic tree based on the deduced 1,363 aa sequences of the complete S gene. Sequence 
alignments and clustering were performed by ClustalW in MEGA 7.0 software. The tree was 
constructed by the maximum likelihood method with bootstrap values calculated for 1,000 replicates. 
The strains in this study were marked with a circle, the other Chinese BCoV strains were marked with 
a triangle. 

Fig.2 Amino acid variants of the 13 complete S genes in this study. The figures in the box indicate the 
identical amino acid change sites in all 13 strains in this study compared with the BCoV prototype 
strain Mebus S sequences; the figure marked with triangle was an unique aa variant in the 4 sequences 
in this study and 12 sequences from Chinese Yaks; the figure marked with circular was an unique aa 
variant in the 9 sequences in this study and 13 sequences (12 from Chinese Yaks, 1 from Chinese 
cattle); the figure marked with line was an unique aa variant in Shanxi strains in this study; which 
compared with the other available BCoV S sequences in the GenBank database. SP: signal-peptide; 
Sl-NTD: receptor binding domain; S1A and SIB: the immune reactive domain; HP: the first 
hydrophobic domain of the S2 subunit; HR-N and HR-C: the heptad repeats. 

Fig.3 Phylogenetic tree based on the deduced 424 aa sequences of the complete HE gene. Sequence 
alignments and clustering were performed by ClustalW in MEGA 7.0 software. The tree was 
constructed by the maximum likelihood method with bootstrap values calculated for 1,000 replicates. 
The strains in this study were marked with a circle, the other Chinese BCoV strains were marked with 
a triangle. 

Fig.4 The recombination analysis of strain SWUN/SX2/2018 using SimPlot 3.5.1. A nucleotide (nt) 
identify plot comparing a 1,275 bp fragment of strain SWUN/SX2/2018 with BCoV strains KCD1 
(GenBank accession number DQ389642) and LY138 (GenBank accession number M84486) is 
shown. The putative recombination region is located at the 168 bp-702 bp. The vertical axis indicates 
the similarity (%) of nucleotide sequences between the query strain and other reference strains. The 
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horizontal axis indicates the nucleotide positions. SimPlot analysis was performed using a window 
size of 200 nt and step size of 20 nt (only one recombinant strain was showed due to typewritten 
pages requirement). 

Fig.5 Amino acid variants of the 13 complete HE genes in this study. The figures in the box indicate 
the identical amino acid change sites in all 13 strains in this study compared with the BCoV prototype 
strain Mebus HE sequence; the figure marked with triangle was an unique aa variant in the 9/10 
recombinant HE sequences and the figure marked with circular was an unique aa variant in the 10 
recombinant HE sequences in this study and 2 other Chinese strains (GenBank accession number 
KU886219, MH810163), which compared with the other available BCoV HE sequences in the 
GenBank database. SP: signal-peptide; MP: membrane-proximal domain; E: esterase domain; R: 
lectin domain. Recombination domain is located at E-loop, Rl-loop, R2-loop and R3-loop between 
esterase and lectin domain of HE. 

Fig.6 Phylogenetic tree based on the deduced 448 aa sequences of the complete N gene. Sequence 
alignments and clustering were performed by ClustalW in MEGA 7.0 software. The tree was 
constructed by the maximum likelihood method with bootstrap values calculated for 1,000 replicates. 
The strains in this study were marked with a circle, the other Chinese BCoV strains were marked with 
a triangle. 

Fig.7 Phylogenetic tree based on the deduced 230 aa sequences of the complete M gene. Sequence 
alignments and clustering were performed by ClustalW in MEGA 7.0 software. The tree was 
constructed by the maximum likelihood method with bootstrap values calculated for 1,000 replicates. 
The strains in this study were marked with a circle, the other Chinese BCoV strains were marked with 
a triangle. 
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Tablel.The result of BCoV detection in diarrhea fecal samples 


Province 

Number of 

farms 

Numbers 

of samples 

The positive 

rate% 

Sichuan 

4 

56 

16 

Shandong 

3 

23 

8.7 

Shanxi 

2 

40 

5.0 

Henan 

3 

15 

46.7 

Liaoning 

1 

18 

77.8 

Jilin 

1 

38 

5.3 

Total 

14 

190 

18.95 
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• MKD95172/SWUN/HN2/2018 

• MKD95173/SWUN/HN3/2018 

• MKD95171/SWUN/HN1/2018 

• MKD95165/SWUN/SX2/2018 

• MKD95164/SWUN/SX1/2018 
-• MKD95166/SWUN/LN1/2018 

• MK095167/SWUN/LN2/2018 

• MKD95168/SWUN/LN3/2018 

• MK095169/SWUN/LN4/2018 

• MKD95170/SWUN/LN5/2018 

• MKD95163/SWU/SC3/2017 
.• MWD95161/SWUN/SC1/2017 
I# MKO95162/SWUN/SC2/2017 

-▲ MH810163/HY24/CH/2017/YAK 

AF391541/BCoV-ENT/B ovine 
E F424624/US/OH3/2006/C alf- giraffe 
E F424623/US/OH3/2003/Giraffe 
EF424621/US/OH1/2003/Sable 
EF424615/E-AH65/B ovine 
-MH043952/4-17-03/Bovine 


MH043953/4-17-25/B ovine 

-MH043955/7-16-23/B ovine 

-FJ938066/US/OH-440-TC/1996/Bovine 

FJ425187/US/OH-WD470/1994/White-tailed deer 
DQ915164/AJpaca/B ovine 
-AF391542/BCoV-LUN/B ovine 


- EF424617/R-AH65/Bovine 


k KM985634/HLJ-14/CHN/Bovine 


MG518518/W17-18/Bovine 
EU401981/D502/B ovine 
EU401980/0501/Bovine 
F J425186/US/OH-WD358/1994/Waterbuck 
FJ425185/US/OH-WD358-GnC/1994/Waterbuck 
-▲ KU886219/BCV-AKS-01/B ovine 

I KT318094/BCoV/FRA/EPI/Caen/2014/12/Bovine 
KT318090/8CoV/FRA/EPI/Caen/2013/08/B ovine 
K5082264/8CoV2014/Bovine 
FJ938063/E-DB2-TC/Bovine 
DQ811784/DB2/Bovine 

FJ425190/US/OH-WD388-TC/1994/Sambar dee 
FJ425189/US/OH-WD388/1994/Sambar dee 

EU401984/SUN5/B ovine 
EU401983/A3/Bovine 

EU401965/BC94/Bovine 
AB354579/Kakegawa/Bovine 
j A FJ556872/YC/Bovine 
IAF058942/LY-138/Bovine 
AF058944/OK-0514-3/Bovine 

761U00735/ Mebus/B ovine 
AF220295/Quebec/B ovine 

-EF193074/V270/B ovine 

-EF193073/L9/B ovine 


- AF058943/LSU-94LSS-051 -2/Bovine 


- NC 005147/OC43/Human 
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• MKD96159/SWUN/HN2/2018 

• MKD95160/SWUN/HN3/2018 

• MKD95158/SWUN/HN1/2018 

• MKD95155/SWUN/LN3/2018 

• M KD95154/SWU N/LN2/2018 

• MK095153/SWUN/LN1/2018 

• MKD95152/SWUN/SX2/2018 

• MKD95151/SWUN/SX1/2018 

• MKD95150/SWUN/SC3/2017 

• MKD95149/SWUN/SC2/2017 

• MKD95148/SWUN/SC1/2017 
MH043955/7-16-23/Bovine 
MH043952/4-17-03/Bovine 
MH043954/4-17-08/Bovine 
MH043953/4-17-25/Bovine 

I • MKD95156/SWUN/LN4/2018 
6sl • MKD95157/SWUN/LN5/2018 
AH014872/KWD7/B ovine 
AH014870/KWD5/B ovine 
FJ938065/AH187/Bovine 

FJ425187/US/OH-WD470/1994/White-tailed deer 
EF424624/US/OH3/2006/Calf-giraffe 
EF424623/US/OH3/2003/Calf-giraffe 
EF424621/US/OH1/2003/Sable antelope 
EF424616/E-AH65-TC/B ovine 
EF424618/R-AH65-TC/Bovine 
DQ915164/Alpaca/Bovine 
AF391542/BCoV-LUN/B ovine 

-AF391541/BCoV-ENT/B ovine 

-▲ KM985633/HLJ-14/CHN/B ovine 

EU019216/Bubalus/ltaly/179/07-11/Bovine 
KX982264/BCoV 2014/Bovine 

-EF371515/KV0501/Bovine 

▲ KU886219/BCV-AKS-01/Bovine 
AF058943/LSU-94LSS-051 -2 /Bovine 
FJ425190/US/OH-WD388-TC/1994/Sambar deer 
FJ425189/US/OH-WD388/1994/Sambar deer 
FJ425186/US/OH-WD358/1994AVaterbuck 
FJ425185/US/OH-WD358-GnC/1994/Waterbuck 
DQ811784/DB2/Bovine 

FJ425188/US/OH-WD388-TC/1994/Sambar deer 
FJ938063/E-DB2-TC/B ovine 

-▲ MH810163/H Y24/CH/2017/YAK 


- AB354579/Kakegawa/Bovine 


U00735/Mebus/Bovine 
AF220295/Ouebec/Bovine 
M27474/matnx protein/Bovine 
AF058942/LY-138/Bovine 


- NC 005147/OC43/Human 
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